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QUANTIZATION NOISE SHAPING METHOD AND APPARATUS 

BACKGROUND OF THE INVENTION 
[01] This application claims the priority of Korean Patent Application No, 
2003-2718, filed on January 15, 2003, in the Korean Intellectual Property 
Office, the disclosure of which is incorporated herein in its entirety by 
reference. 

1 . Field of the Invention 

[02] The present invention relates to compression of audio data, and more 
particularly, to a method and apparatus for shaping quantization noise 
generated when compressing audio data at a low bit rate. 

2. Description of the Related Art 

[03] Compression of audio data is achieved by performing sampling, 
quantizing, encoding, and so forth. Quantization refers to expressing sampled 
signal values as stepped integers to represent the sampled values as 
predetermined representative values. Such a quantization process generates 
quantization noise. The quantization noise is an error component between an 
original signal and a quantized signal and is attenuated with an increase in a 
number of bits used for the quantization process. In quantization according to 
the Moving Picture Experts Group (MPEG), which are standards for coded 
representation of moving pictures and digital audio, a factor generated by a 



Discrete Cosine Transform (DCT) or a Modified DCT (MDCT) is divided by 
a predetermined value to express the factor as a low factor value so £is to 
reduce an encoding amount. 

[04] Audio data should be compressed in consideration of the properties of 
the human auditory system. In general, one sound cannot be heard when a 
much louder sound is present. For example, if a person in an office speaks 
loudly, the others in the office can easily perceive who is speaking. However, 
if an airplane passes over the office building, the listeners cannot hear at all 
what the speaker is saying. In addition, after the airplane passed over the 
building, the listeners still cannot hear what the speaker is saying due to the 
lingering sound of the airplane. This is called a masking effect. 
[05] FIG, 1 illustrates the masking effect. Referring to FIG. 1, let us 
assume that an audio fi-equency contains a masking curve 130 indicating a 
sound energy level at which the average human can hear a sound. Since an 
audio signal A 1 10 has a sound energy level above the masking curve 130, the 
audio signal A 110 is audible to the average human. In contrast, since an 
audio signal B 120 has a sound energy level below the masking curve 130, the 
audio signal B 120 is inaudible to the average human. 

[06] Psychoacoustic model quantization refers to the quanitzation of only 
audio data with a sound energy level above a masking threshold by sectioning 
an audio fi-equency into fi-equency bands at predetermined intervals. The 
psychoacoustic model quantization is used in compression standards such as 
MPEG. However, in a case where audio data is compressed at a low bit rate 



below 64Kbps, a number of bits used for quantization is limited. Thus, a 
general compression technique according to MPEG standards is not suitable 
for an effective compression of an audio signal. 

[07] FIGS. 2 A and 2B show a quantization noise spectrum with respect to a 
frequency, the spectrum being generated after performing quantization. 
[08] In a psychoacoustic model, an audio signal is received, and then a Fast 
Fourier Transform (FFT) is performed to calculate and output a quantization 
threshold 210 in each frequency band. The quantization threshold 210 may be 
calculated so that the average human cannot discem between an original signal 
and a quantized signal. A quantization threshold in actual quantization may 
appear as reference numeral 210 or 240. If the quantization threshold 210 is 
obtained in the actual quantization, quantization noise may fall within the 
quantization threshold 210 according to the psychoacoustic model, which does 
not affect sound quality. If the quantization threshold 240 is obtained in the 
actual quantization, sound quality degrades. Thus, quantization noise has to 
be shaped so as to fall within the quantization threshold 210. However, since 
a low bit rate audio signal is expressed and quantized with a limited number of 
bits, quantization noise cannot always be shaped within a quantization 
threshold. 

[09] Accordingly, a conventional quantization algorithm used for the 
compression of an audio signal uses a simple way to confine a number of 
times quantization noise is shaped so that the shaping of the quantization noise 
ends when quantization noise cannot be below a quantization threshold 
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calculated in the psychoacoustic model. The confinement may allow the 
quantization noise to have a predetermined shape, which causes the 
quantization noise to exceed the quantization threshold in a predetermined 
number of frequency bands. As a result, sound quality deteriorates. 

SUMMARY OF THE INVENTION 
[10] The present invention provides a quantization noise shaping method 
and apparatus, by which the distortion of audio data can be reduced by shaping 
quantization noise generated during quantization of low bit rate audio data so 
that a quantization noise curve is similar to a quantization threshold curve 
calculated in a psychoacoustic model even though the quantization noise is 
above the quantization threshold in all frequency bands. 

[11] According to an aspect of the present invention, there is provided a 
method of shaping quantization noise. A predetermined quantization noise 
threshold allowed during quantization of sampled audio data and quantization 
noise energy information of a quantized MDCT coefficient are received in all 
frequency bands of an audio frequency. The quantization noise energy of the 
quantized MDCT coefficient is attenuated in a predetermined number of 
frequency bands in which a difference between the predetermined quantization 
noise threshold and the quantization noise energy of the quantized MDCT 
coefficient is large. 

[12] According to another aspect of the present invention, there is provided 
a method of shaping quantization noise. During compression of an audio 
signal at a predetermined bit rate, a determination is made as to whether 



4 



quantization noise in all frequency bands falls below a threshold noise level 
calculated in a psychoacoustic model. If the quantization noise does not fall 
below the threshold noise level, quantization noise is shaped in each of the 
frequency bands to be equal to the threshold noise level, with an offset error. 
[13] According to still another aspect of the present invention, there is 
provided a method of shaping quantization noise. Total quantization noise of 
a quantized MDCT coefficient and a sum of quantization noise thresholds 
calculated in a psychoacoustic model are calculated. The total quantization 
noise of the quantized MDCT coefficient is compared with the sum of the 
quantization noise thresholds. If the total quantization noise of the quantized 
MDCT coefficient is less than the sum of the quantization noise thresholds, 
quantization noise is attenuated in every frequency band, while if the total 
quantization noise of the quantized MDCT coefficient is greater than the sum 
of the quantization noise thresholds, quantization noise is attenuated in 
selected frequency bands. 

[14] According to yet another aspect of the present invention, there is 
provided an apparatus for adjusting a quantization noise distribution. The 
apparatus includes a quantization noise calculator that calculates total 
quantization noise of a quantized MDCT coefficient and a sum of quantization 
noise thresholds calculated in a psychoacoustic model, a noise attenuation 
algorithm selector that compares the total quantization noise of the quantized 
MDCT coefficient with the sum of the quantization noise thresholds to 
determine whether a quantization noise attenuation is performed in every 
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frequency bands or in selected frequency bands, a quantization noise 
attenuator that attenuates quantization noise in every frequency band, and a 
band selective quantization noise attenuator that attenuates quantization noise 
in selected frequency bands. 

[15] According to yet another aspect of the present invention, there is 
provided a computer-readable recording medium on which a program for 
executing the method of the present invention in a computer is recorded. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[16] The above and other features and advantages of the present invention 
will become more apparent by describing in detail exemplary embodiments 
thereof with reference to the attached drawings in which: 
[17] FIG. 1 illustrates a masking effect; 

[18] FIG. 2A and 2B show a quantization noise spectrum with respect to a 
frequency, the spectrum being generated after performing quantization; 
[19] FIG. 3 is a block diagram of a quantization noise shaping apparatus; 
[20] FIG. 4 is a flowchart of a method of shaping quantization noise; 
[21] FIGS. 5A and 5B illustrates shaping of noise energy of a quantized 
MDCT coefficient by adjusting a scale factor band gain in each frequency 
band; 

[22] FIG. 6 illustrates a process of selectively increasing a scale factor band 
gain in each frequency bandwidth; 

[23] FIG. 7 is a flowchart of a method of reducing quantization noise 
according to the present invention; and 



[24] FIG. 8 is a block diagram of a quantization noise attenuator according 
to the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
[25] FIG. 3 is a block diagram of a quantization noise shaping apparatus. A 
quantizer for an MPEG audio encoder includes a bit rate controller 310 which 
controls a bit rate, a quantization noise calculator 320 which calculates 
quantization noise energy, a scale factor band gain adjuster 330 which 
compares the quantization noise energy received from the quantization noise 
calculator 320 with a quantization noise threshold received from a 
psychoacoustic model and adjusts a scale factor band gain given to each 
frequency band to shape a quantization noise curve in each frequency band, 
and a determiner 340 which transmits a command to reset a number of bits to 
the bit rate controller 310 and determines whether to end a quantization 
process under a predetermined condition. The operations of the above 
components are described in detail in the MPEG standards (ISO 14496-3 
Annex B). 

[26] The bit rate controller 310 receives an audio frame, quantizes an 
MDCT coefficient of the received audio frame, Hufftnan-codes the 
quantization result, and calculates a number of bits used during the Huffinan- 
coding. In other words, the bit rate controller 310 calculates a number of bits 
corresponding to a bit rate determined for coding of an audio signal and 
adjusts the number of bits until a number of bits smaller than the calculated 
number of bits can be used for coding, by adjusting a common gain. 



[27] When a quantized MDCT coefficient is denoted as Xquanb a received 
MDCT coefficient is denoted as mdct_line, and a scale factor is denoted as sf, 
the quantized MDCT coefficient Xquant is calculated as in Equation 1 : 

1 Zl(j/_i00) 

^mdctjine '2'^ ...(1) 

[281 The scale factor sf is calculated as in Equation 2: 

sf=common_gain - sjb_gain(sjb) . . .(2) 
wherein common_gain is the common gain that is used to satisfy a given 
number of bits in the audio fi-ame and is determined by an intemal loop that 
adjusts a number of bits to be used to a predetermined bit rate, and sfl5_gain is 
the scale factor band gain indicating a degree to which a scale factor is 
adjusted to shape the quantization noise and is determined by an external loop 
that selectively adjusts the scale factor band gain sfb gain in each fi"equency 
band. Consequently, sfb_gain is expressed as a fiinction of the sib. As can be 
seen in Equations 1 and 2, the common gain common_gain should be low and 
the scale factor band gain sfl3_gain should be large in order that an error 
between the quantization MDCT coefficient Xquant and the received MDCT 
coefficient mdct_line is low. 

[29] The quantization noise calculator 320 calculates quantization noise in 
each fi*equency band using the error between the quantization MDCT 
coefficient Xquant and the received MDCT coefficient mdct_line. 
[30] The scale factor band gain adjuster 330 compares the quantization 
noise received firom the quantization noise calculator 320 with the quantization 
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noise threshold received from psychoacoustic model to adjust a quantization 
noise level in each frequency band. The adjustment of the quantization noise 
level in each frequency band is achieved by adjusting the scale factor band 
gain. 

[31] The determiner 340 adjusts the scale factor to shape the quantization 
noise and then makes a determination whether to end the quantization process 
by determining whether the adjusted scale factor band gain has been amplified 
to a predetermined maximum value, whether differences among the scale 
factor band gains adjusted in frequency bands are greater than a predetermined 
reference value or whether quantization noise in every frequency band is 
lower than the quantization noise threshold calculated in the psychoacuostic 
model. 

[32] In a conventional quantization noise shaping method, a common gain 
commonly applied to every frequency band is adjusted to perform an intemal 
loop that adjusts a number of bits to be used to a predetermined bit rate and an 
external loop that adjusts a scale factor band gain used for shaping of the level 
of quantization noise in each frequency band. In the extemal loop, a number 
of bits allocated to each frequency bandwidth are summed, a common gain is 
increased to reduce a used number of bits if the summed value is greater than a 
predetermined threshold, to which the scale factor band gain adjusted in each 
frequency band is encoded, to be less than a predetermined threshold, and the 
scale factor band gain is increased in each frequency band to a predetermined 
value so that the scale factor band gain stays below a predetermined threshold 



in each frequency band. The external loop is repeated until the quantization 
noise in every frequency band falls within the quantization noise threshold. 
[33] FIG. 4 is a flowchart of a method of shaping quantization noise. The 
method includes: calculating a number of bits corresponding to a 
predetermined bit rate at which an audio signal is to be coded and adjusting a 
common gain until a number of bits smaller than the calculated number of bits 
is used for the coding of the audio signal so as to adjust the number of bits 
used for the coding. 

[34] In step S4I0, a bit rate is controlled. In other words, an audio frame is 
received and then an MDCT coefficient of the audio frame is quantized. Next, 
the quantized MDCT coefficient is Huffinan-coded, and then a number of bits 
used for the Huffman-coding is calculated. In other words, a number of bits 
corresponding to a predetermined bit rate at which an audio signal is to be 
coded is calculated, and then a common gain is adjusted to adjust the number 
of bits until a number of bits smaller than the calculated number of bits is used 
for the Huffman-coding. For example, when a frame of an audio signal is 
sampled 1024 times at 44.1 KHz, a number of bits used for coding the 1024 
frame samples at 128 kbps is calculated as in Equation 3, and a common gain 
is adjusted to be less than the calculated number of bits: 

-129^ X 1 28,000 = 2,972 . . .(3) 
44,100 

[35] In step S420, quantization noise energy is calculated in all frequency 
bands of an audio frequency. In other words, the magnitude of quantization 
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noise energy in each frequency band is calculated using a difference between a 
received MDCT coefficient mdct_line and a quantized MDCT coefficient 
Xquant- In Step S430, scale factors used for the calculation of the magnitude of 
the quantization noise energy are stored. In step S440, a determination is 
made whether the calculated magnitude of the quantization noise energy is 
greater than the quantization noise threshold calculated in the psychoacoustic 
model. If the quantization noise energy, is greater than the quantization noise 
threshold, noise energy of the quantized MDCT coefficient Xquant is reduced. 
The reduction in the noise energy of the quantized MDCT coefficient may be 
achieved by adjusting the scale factor band gain. 

[36] FIGS. 5A and 5B illustrate the adjustment of the noise energy of the 
quantized MDCT coefficient through the adjustment of the scale factor band 
gain in each frequency band. 

[37] Let us assume that the quantization noise energy of the quantized 
MDCT coefficient appears as reference numeral 520 of FIG. 5A. As can be 
seen in FIG. 5A, since the quantization noise energy of the quantized MDCT 
is greater than a quantization noise threshold 510 calculated in the 
psychoacoustic model, in step S450, the scale factor band gain is adjusted in 
each frequency band. In step S460, a determination is made whether the scale 
factor band gain in every fi-equency band has been increased. If the scale 
factor band gain in every frequency band has been increased, a determination 
is made that a desired sound quality requirement is not satisfied at a given bit 
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rate, and the shaping of the quantization noise ends using the scale factors 
stored in step S430. If not, a next step is performed. 

[38] The adjustment of the scale factor band gain may result in the shaping 
of the quantization noise as indicated by arrows 530 or 540. However, the 
scale factor band gain is increased to a limit. Thus, in step S470, a 
determination is made whether the quantization noise is shaped to fall within 
the quantization noise threshold 510 only when the scale factor band gain is 
increased to be above a predetermined threshold value. If the determination is 
made that the quantization noise is shaped to fall within the quantization noise 
threshold 510 only when the scale factor band gain is increased to be above 
the predetermined threshold value, in step S490, a determination is made that a 
desired sound quality is not satisfied at the given bit rate and the shaping of 
the quantization noise ends using the stored scale factors. If not, a next step is 
performed. 

[39] In step S480, a determination is made whether quantization noise in at 
least one fi^equency band is above the quantization noise threshold. If the 
determination is made that the quantization noise in the at least one frequency 
band is above the quantization noise threshold, step S410 restarts to readjust 
the number of bits. In other words, the number of bits increases little by little 
so that the number of bits is below a threshold value. 

[40] FIG. 6 illustrates a process of selectively increasing the scale factor 
band gain in each frequency band. As shown in FIG. 6, a threshold 610 is 
calculated in a psychoacoustic model. Noise energy 620 of a quantized 
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MDCT coefficient is calculated. A quantization error is reduced in a 
predetermined number of frequency bands in which a difference between the 
threshold 610 and the noise energy 620 of the quantized MDCT coefficient is 
great. The difference is the greatest in a frequency band 1 640, a fi-equency 
band 2 650, and a fi-equency band 3 660. Thus, the quantization noise is first 
reduced in the frequency band 1 640, the frequency band 2 650, and the 
frequency band 3 660. In other words, instead of reducing quantization noise 
in every frequency band, a process of reducing noise energy of a quantized 
MDCT coefficient in a predetermined number of particular frequency bands is 
repeated so that the same amount of quantization error occurs in all the 
frequency bands. 

[41] In a method of shaping quantization noise in the compression of 
MPEG audio data, according to the present invention, an allowed bit rate is 
too low for quantization noise to be below a threshold noise level calculated in 
a psychoacoustic model. Nevertheless, a scale factor band gain adjuster can 
variably adjust a scale factor band gain according to the MPEG standards in 
order to shape the quantization noise in each frequency band to the threshold 
noise level in each frequency band in the psychoacoustic model. 
[42] The conventional method separately performs an external loop for each 
frequency to increase the scale factor band gain in each frequency band by 
comparing the quantization noise in each frequency band with the quantization 
noise threshold. However, in the present invention, instead of comparing 
quantization noise with quantization noise threshold in an external loop 
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through which a scale factor band gain is adjusted, the external loop ends after 
first adjusting the scale factor band gain all the frequency bands in which 
quantization noise is the highest according to the ranking of noise-to-mask 
ratios (NMRs) in the frequency bands. 

[43] FIG. 7 is a flowchart of a method of attenuating quantization noise 
according to the present invention. 

[44] In step S710, total quantization noise of a quantized MDCT coefficient 
and a total sum of quantization noise thresholds calculated in a psychoacoustic 
model are calculated. In step S720, the total quantization noise of the 
quantized MDCT coefficient is compared with the total sum of the 
quantization noise thresholds. If the total quantization noise of the quantized 
MDCT coefficient is less than the total sum of the quantization noise 
thresholds, in step S730, the quantization noise is attenuated according to an 
existing method. If the total quantization noise of the quantized MDCT 
coefficient is greater than the total sum of the quantization noise thresholds, in 
step S740, the quantization noise is selectively attenuated in each firequency 
band. In other words, an extemal loop ends after adjusting a scale factor band 
gain in fi*equency bands of all frequency bands in which quantization noise is 
higher than a quantization noise threshold according to the ranking of NMRs 
in all the frequency bands. A process of attenuating quantization noise in the 
entire firequency band is as described with reference to FIG. 4. 
[45] FIG. 8 is a block diagram of a quantization noise attenuating apparatus 
according to the present invention. Referring to FIG. 8, the quantization noise 
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attenuating apparatus includes a quantization noise calculator 810, a noise 
attenuation algorithm selector 820, a quantization noise attenuator 830, and a 
band selective quantization noise attenuator 840. 

[46] The quantization noise calculator 810 calculates total quantization 
noise of a quantized MDCT coefficient and a sum of quantization noise 
thresholds calculated in a psychoacoustic model. 

[471 The noise attenuation algorithm selector 820 compares the total 
quantization noise value of the MDCT coefficient with the sum of the 
quantization noise thresholds to determine whether a quantization noise 
attenuation is performed in all fi*equency bands or in selected particular 
firequency bands. 

[48] The quantization noise attenuator 830 attenuates quantization noise in 
all the fi-equency bands. In other words, when a predetermined bit rate is 
determined to compress an audio signal, the quantization noise attenuator 830 
calculates a number of bits corresponding to the predetermined bit rate, adjusts 
the number of bits by adjusting a common gain until a number of bits smaller 
than the calculated number of bits is used for the compression, and adjusts a 
degree to which quantization noise is attenuated in each firequency band by 
adjusting a scale factor band gain. Details of this are as described with 
reference to FIG. 4. 

[49] The band selective quantization noise attenuator 840 attenuates 
quantization noise in selected frequency bands. In other words, the band 
selective quantization noise attenuator 840 adjusts scale factors in a 
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predetermined number of frequency bands according to the ranking of NMRs 
of the number of frequency bands in which the quantization noise of the 
quantized MDCT coefficient is greater than the quantization noise threshold in 
the psychoacoustic model. 

[50] As described above, according to the present invention, even if an 
allowed bit rate disables quantization noise to fall below a quantization noise 
threshold obtained from a psychoacoustic model, an envelope of the 
quantization noise can be shaped to be equal to a curve of the quantization 
noise threshold. Thus, quantization noise in each frequency band is equally 
above the quantization noise threshold. As a result, unlike the prior art, the 
present invention can prevent quantization noise threshold in particular 
frequency bands from excessively going beyond the quantization noise. This 
results in an improvement of sound quality. 

[51] In quantization for existing MPEG audio compression, a limited 
number of bits is ineffectively allocated, which directly affects deterioration of 
sound quality. However, in the present invention, with selective adoption of 
the prior art bit allocation method, if frequency bands in which quantization 
noise is to be attenuated are many at a low bit rate, qiiantization noise is 
attenuated in frequency bands corresponding to a predetermined bit rate 
instead of attenuating quantization noise in all frequency bands. Even though 
this quantization process does not allow quantization noise in all frequency 
bands to fall below the quantization noise threshold, the quantization noise can 
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be shaped to be similar to the quantization noise threshold. As a result, sound 
quality can be improved. 

[52] The present invention can be realized as a computer-readable code on a 
computer-readable recording medium. Computer-readable recording media 
include recording apparatuses storing computer-readable data. Computer- 
readable recording media include ROMs, RAMs, CD-ROMs, magnetic tapes, 
floppy discs, optical data storage devices, and carrier waves (e.g., transmission 
over the Internet). The computer-readable recording media can also store and 
execute a computer-readable code in computers connected via a network in a 
dispersion way. 

[53] While the present invention has been particularly shown and described 
with reference to exemplary embodiments thereof, it will be understood by 
those of ordinary skill in the art that various changes in form and details may 
be made therein without departing from the spirit and scope of the present 
invention as defined by the following claims. 
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